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Abstract: Let w be an infinite word on an alphabet A. We denote by (ni)j>i the 
increasing sequence (assumed to be infinite) of all lengths of palindromic prefixes of w. In 
this text, we give an explicit construction of all words w such that nj+i < 2nj + 1 for all i, 
and study these words. Special examples include characteristic Sturmian words, and more 
generally standard episturmian words. As an application, we study the values taken by 
the quantity limsupnj+i/nj, and prove that it is minimal (among all non-periodic words) 
for the Fibonacci word. 



1 Introduction 

The purpose of this text is to study infinite words (on an arbitrary, not necessarily finite, 
alphabet A) which have "sufficiently many" palindromic prefixes. The motivation comes 
from diophantine approximation (see below), though this question is also related to physics, 
namely to the spectral theory of discrete one- dimensional Schrodinger operators. Words 
with many palindromic factors can be used in this setting [11], corresponding to the com- 
binatorial notion of "palindrome complexity" (see for instance [1]). On the other hand, 
replacing "whole-line methods" by "half-line methods" in connection with this problem 
leads [4] to the use of words with many palindromic prefixes, like the ones studied below. 

In precise terms, given an infinite word w, we shall denote (in this Introduction) by 
(^i)j>i the increasing sequence of all lengths of palindromic prefixes of with ni = 
corresponding to the empty prefix. The words studied here always have infinitely many 
palindromic prefixes, so we assume the sequence (nj)j>i to be infinite. 

A trivial example of such a word w is any periodic word with a palindromic period. A 
more interesting example is the Fibonacci word w = babbababbabba ... on the two-letter 
alphabet {a, b}, for which the sequence (n^) = (0, 1, 3, 6, 11, . . .) is given by rii = Fj+i — 2 
(where Fi is the i-th Fibonacci number); this follows from [6] (Theorem 5). More generally, 
any characteristic Sturmian word satisfies nj+i < 2nj + 1 for any i, and denoting by 
[0, Si, S2, . . . , ] the continued fraction expansion of its slope we have (see §3.1): 

limsup = limsup[l, 1, Sk, Sk-i, . . . , Si]. (1) 
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In particular, if w is the Fibonacci word then hmsupnj+i/rii is the golden ratio 7 = 
(1 + V5)/2. 

A generalization of characteristic Sturmian words to an arbitrary alphabet has been 
given by Droubay, Justin and Pirillo [8]: these are standard episturmian words. They 
also satisfy nj+i < 2nj + 1 for any i, but there is no easy equation like (1) to compute 

lim sup rii^i/rii. 

In this text, we study the words w with abundant palindromic prefixes in the following 
sense: 

Definition 1.1 An infinite word w is said to have abundant palindromic prefixes if the 
sequence (ni)j>i of all lengths of its palindromic prefixes is infinite and satisfies rii+i < 
2ni + 1 for any i > 1. 

A completely explicit construction of all words with abundant palindromic prefixes is 
given, which generalizes one of the constructions [12] of standard episturmian words. This 
is a strict generalization, i.e., there are words with abundant palindromic prefixes which are 
not standard episturmian. Moreover, our results extend to words such that n^+i < 2ni + 1 
for any sufficiently large integer i; in particular, a general construction of all such words is 
given. 

For any word w, we let 

6(w) = lim sup 

rii 

if w admits infinitely many palindromic prefixes, and 5{w) = 00 otherwise. Then 1/S{w) 
measures the "density" of palindromic prefixes in w. We let V be the set of real numbers 
that can be written 5{w) for some word w (on a suitable alphabet). Moreover, we let Vq 
be the set of all numbers 6{w) obtained from words w with abundant palindromic prefixes. 
The inclusion T>q C X> fl [1, 2] trivially holds, and it is not difficult to prove (see §2.2) that 
(2, +00] C T>. Denoting by U the union of two disjoint sets, the following result holds. 

Theorem 1.2 We have V = VoU (2, +00]. 

Actually, for any word w such that 5{w) < 2, there is a word w' with abundant palin- 
dromic prefixes such that the palindromic prefixes of w satisfy the same recurrence relation 
as those of w' (see Proposition 6.1) and, therefore, d{w) = d{w'). 

The easiest examples of words with abundant palindromic prefixes are periodic words 
(with a palindromic period) and characteristic Sturmian words (for which 6{w) can be 
computed thanks to Equation (1)). Denote by V the set of numbers d{w), for these words 
w. Obviously we have V gVq, and the following theorem shows that this inclusion is an 
equality if we restrict to words with "sufficiently many" palindromic prefixes: 

Theorem 1.3 We have V n [1, Vs] = n [1, ^/^^V{^ [1, ^3]. 
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For a periodic word w with a pahndromic period, we have trivially 5{w) — 1. For 
a characteristic Sturmian word w with slope [0, Si, • • • j ]j Equation (1) allows one to 
compute 6{w). From this it is easy to deduce that the characteristic Sturmian word w 
with minimal value of 6{w) is the Fibonacci word. This shows that 1 and the golden ratio 
7 = (1 + V5)/2 are the two smallest elements in V. Cassaigne studied ([3], Corollary 1 
and Theorem 2) the next elements, and his result (together with Theorem 1.3) yields: 

Theorem 1.4 The smallest elements in Vq (resp. inV) make up an increasing sequence 
(o"n)n>o with ctq = 1 and a\ = j, converging to the smallest accumulation point of Vq 
(resp. ofV). 

In more precise terms, this statement means that "Do H [1, Coo) = {(T„, n > 0}. Moreover 
all an, and Uoo = 1.721 .. ., are given in an explicit way in terms of their continued fraction 
expansion. For instance, writing fn for the periodic repetition mmm . . . — m'^ of a finite 
sequence m, we have: 

CT2 =l + v^/2 = 1.707... = [1,1,2] 
and (73 =(2 + \/l0)/3 = 1.720 ... = [1, l,2Xl]- 

As a corollary, we see that the Fibonacci word has maximal "palindromic prefix density" 
among non-periodic words: 

Corollary 1.5 Let w be an infinite word with S{w) < 7. Then w is periodic. 

For a characteristic Sturmian word w with slope [0, si, S2, . . . , ], Morse and Hedlund 
have computed [14] the recurrence function of w. This gives (see Corollary 1 of [3]) a 
formula for the recurrence quotient g{w) of w, namely g{w) = 2 + limsup[sfc, Sk-i, ■ ■ ■ , Si]. 
Therefore Equation (1) gives in this case: 

2g{w) - 3 
g{w) - 1 

hence (as above) the Fibonacci word has minimal recurrence quotient (equal to {5 + y/5)/2) 
among all characteristic Sturmian words. Rauzy has conjectured [16] that it has minimal 
recurrence quotient among all non-periodic words. Corollary 1.5 is an analogue of this 
conjecture. 

The motivation for this text comes from diophantine approximation. Actually \ {1} 
is equal [9] to the set denoted by So H [1,2] in [10], defined in terms of an exponent that 
measures the simultaneous approximation to a real number and its square by rational 
numbers with the same denominator. In particular. Theorem 2.1 in [10] follows from this 
equality and Theorem 1.3 stated above. 
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This connection between palindromic prefixes and diophantine approximation is due to 
Roy [17]. It allows one to get a purely number-theoretical proof of Corollary 1.5 stated 
above, by applying Davenport-Schmidt's theorem [5] on simultaneous approximation to ^ 
and to the real number ^ obtained (as in [17]) from an infinite word w. 

The structure of this text is as follows. We first explain the notation (§2.1), and prove 
that for any a > 2 there is a word w such that 6{w) = a (§2.2). This explains why the 
rest of the text is devoted only to words w such that 6{w) < 2. 

Then we recall how characteristic Sturmian words (§3.1) and standard episturmian 
words (§3.2) are constructed, with a special emphasis on their palindromic prefixes. In 
Section 4, we construct all words with abundant palindromic prefixes (§4.1). To study 
these words, the key definition is the one of reduced functions, which allows us to state 
(§4.2) the main results on words with abundant palindromic prefixes. Moreover, we explain 
(§4.3) how to compute 6{w) for such a word w, using the associated reduced function ip. 
The proof of the results stated in Section 4 is given in Section 5, using general lemmas 
(§5.1 and 5.2) that might be of independent interest. 

Next we briefiy explain how to generalize the results of Section 4 to words that satisfy 
^i+i < 2nj + 1 for any sufficiently large i (§6.1). This allows to prove (in §6.2) Theorem 
1.2 stated above. 

Theorem 1.3 is proved in §7.1, and the set T>q (resp. V) is studied near ^/3 in §7.3 
(resp. in §7.4); this implies that Theorem 1.3 is optimal. We also define ^-strict words with 
abundant palindromic prefixes in §7.2, and prove that any ^-strict standard episturmian 
word w such that 6{iv) < ^/S is cither periodic or characteristic Sturmian. 

Section 8 contains questions and open problems about words with abundant palindromic 
prefixes. At last, Section 9 is an appendix devoted to the proof of two technical results: 
Proposition 6.2 (stated in §6.2) and Lemma 7.1 (stated in §7.1). These statements concern 
asymptotic properties of the sequence (n^) associated with a word w such that 5{w) < 2. 
They are also useful for the diophantine analogue [9] of this text. 

Acknowledgements: I am very thankful to Jean-Paul AUouche and Boris Adamczewski 
for their help and for pointing out to me crucial references in combinatorics. I would 
like also to thank Damicn Roy and Michel Waldschmidt for their support in the number- 
theoretic counterpart of this paper, and Jacques Justin for many useful remarks. At last, 

1 am indebted to Jimena Sivak for help in the redaction. 

2 Notation and a Peculiar Construction 
2.1 Notation 

Throughout the text, we consider a (finite or infinite) alphabet A, which we assume to be 
disjoint from N* = {1, 2, 3, . . .}. Of course, this is not a serious restriction; it allows us to 
consider ^ U N* as a disjoint union. 
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We denote hy \u\ the length of a finite word u, that is the number of letters in u, and by 
e the empty word (which has length zero) . Given a finite word u — Ui . . .Up with Ui E A 
for any i G {1, . . . ,p}, we denote by u its mirror image Up . . .Ui, in such a way that u is a 
palindrome if, and only if, u = u. We set e = so that e is considered a palindrome. We 
say that a word u' — u\. . . u'^, is a prefix of w if p' < p and Uj = u'j for any j < p', that is if 
there is a word u" such that u — u'u" . We extend this definition to the case where u is an 

infinite^ word UiU2U^ In particular, £ is a palindromic prefix of any (finite or infinite) 

word. 

In the same way, a word u" is a suffix of u if, and only if, there is a finite word u' such 
that u — u'u" . If this happens then either both u and u" are finite, or both u and u" are 
infinite. 

If w and w' are finite words such that w' is a prefix of w, we denote by w'''^w the 
word w" such that w — w'w" . In the same way, \i w — w'w", we write w' = ww"~^. An 
important special case is the following: if w and w' are palindromes and w' is a prefix of 
w, then w' is also a suffix of w and ww'^^w is again a palindrome (of which w is a prefix). 
In this situation, ii w = w'w" then we have ww'^^w = w'w"^ (see Lemma 5.1 below). 

Remcirk 2.1 Let w be a word on the (finite or infinite) alphabet A, such that rij+i < 2nj 
for any i sufficiently large (with the sequence (nj)j>i defined in the Introduction). Then 
only finitely many letters of A occur in w; this follows from Proposition 6. 1 proved below. 
Therefore the interesting case, throughout this paper, is when A is finite. 

2.2 Words with Scarce Palindromic Prefixes 

In this Section, we prove that (2, +oo] C V. This result explains why all words w studied 
in the rest of this text are such that 5{w) < 2. 

Obviously there are words w with only a finite number of palindromic prefixes; they 
satisfy S{w) = oo hence oo G P. Now let a be a real number greater than 2, and choose 
£ > such that 2 + e < a. Denote by {pk)k>o ^ sequence of positive integers such that 
^ tends to a, with ^ > 2 for any k. We define an increasing sequence (nj)j>i in the 
following way. We let rii = 0, ^2 = 1 and if i > 2 is even we let Vj+i be the maximal integer 
such that there exists a multiple of 10^*+% denoted by rij+i, with 2nj < rij+i < {2 + e)ni + l. 
If i > 3 is odd, we let n^+i = p^.^p-. With this definition, we have rij+i > 2ni + 1 for any 
i > 1, and Vi (which is defined only when i is odd) tends to infinity as i tends to infinity. 
This implies limsup ^-^^ = a. 

Now let us construct a word w such that (ni)i>i is exactly the sequence of all lengths 
of pahndromic prefixes of w. We consider an alphabet A — {Sk, A; G N} with 6i ^ 5j when 
i ^ j- We define finite palindromes tTj, of length nj, by tti = £ and, for i > 1: 

f TTj+i = 7rj5j5o*+'~^"*~^5i7rj if rij+i > 2nj + 2 
\ TTj+i = TTj^iTTj if rij+i = 2nj + 1. 

^In this text, we consider only right infinite words. In particular, all palindromes are assumed to be 
finite. 
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Then for any i > 1, tt, is a palindrome written on the alphabet {So, . . . ,5i-i}. It is 

also a prefix of VTj+i, and all palindromic prefixes of iTi^i (except TTj+i itself) are prefixes 
of TTj. The infinite word w defined as the limit of tTj as i tends to infinity satisfies the 
required property: its palindromic prefixes are exactly the TTj's, with rii = \7Ti\. Therefore 
a = limsup ^ = S{w) e V. 

This proves the desired result, namely (2, +00] C V. 

Remcirk 2.2 It is possible to adapt this construction to any fixed finite alphabet containing 
at least two letters. This proves that for any (finite or infinite) alphabet A with at least 
two letters, and for any a e (2, +00], there exists a word w on the alphabet A such that 
S{w) — a. 



In this Section, we recall how to construct characteristic, or standard, Sturmian (§3.1) 
and standard episturmian (§3.2) words, with a focus on the properties of their palindromic 
prefixes. 

3.1 Characteristic Sturmian Words 

In this Section, we recall a construction of characteristic Sturmian words (see [13], Chap- 
ter 2) and properties of their palindromic prefixes. 

We consider the two-letter alphabet A = {a, b}. Let Si, S2, . . . , be an infinite sequence 
of positive integers. Define ctq = a, ai = a'^^~^b and, by induction, cr„ = for any 

n>2. In the terminology of [13] (page 75), ((j„) is the standard sequence associated with 
(si — 1, S2, S3, . . .). For any n > 1, (T„ is a prefix of an+i, therefore the words (7„ tend to an 
infinite word Cq,, called the characteristic Sturmian word with slope a — [0, si, S2, ■ ■ ■]■ 

For n >2 and 1 < p < <s„, the word (T^_j^(J„_2 is a prefix of (T„ = (J^!Lj^(J„_2 (since (j„_2 
is a prefix of (7„_i), hence of c^. Moreover it ends with ba if n is even, and with ab if n is 
odd. As it is a standard word, there exists a palindrome 7fn,p such that 



Actually 7fn,p is even a central word, so it can be written Trabir' for some palindromes tt, tt' 
(see [7]). However, in what follows, we shall use only the fact that the words TTn,p defined 
in this way are palindromes. This fact can be proved directly (see for instance [2], Lemma 



We shall now define a sequence {ni)i>i of palindromic prefixes of Cq. First, for any 
/c > 1 we let tfc = Si + . . . + Sk- Now observe that for any i > Si there is exactly one pair 
{n,p) with n >2 and 1 < p < s„ such that i = + p — 1. Therefore the equality 



3 Sturmian and Episturmian Words 




(2) 



5.3). 



T^n,p = T^tn-i+p-i for n > 2 and 1 < p < s. 



•n 
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defines tt^ in a unique way for i > si (and T^tk-i — T^k,sk is obtained from by removing 
the last two letters). If si > 2, we let tTj = a'~^ for any i G {1, . . . , Si — 1}. Then tTj is 
defined for any i > 1; we have tti = £ and each tTj is a prefix of TTj+i. Moreover all TTj's are 
palindromic prefixes of Cq. 

Actually the TTj's are the only palindromic prefixes of c„. This follows from de Luca's 
result ([6], Theorem 5; see also [8], §3) that tTj+i is the right palindromic closure of VTj^j, 
where 5, G ^ is the letter in tTj+i that comes right after tTj (see §3.2 below). Another proof 
of this result can be obtained by applying Theorem 4.12 proved in this text (see Example 
4.6). 

Since the tt^'s are exactly the palindromic prefixes of c^, we have the equality 5{ca) = 
limsup |7ri+i|/|7ri|. It is not difficult to deduce Equation (1) from this (see [2], Proposition 
7.1). 

Let /c > 3. It is not difficult to prove the relation 

T^tk+i = CTfe'^^TTt^-i-i for any £ G {0, ... , s^+i} (3) 

using (for the case I — s^+i) the identity Uk-i'Ktk-'i. — ^k'^tk-i-i (^^e for instance [2], Lemma 
5.1). Prom Equation (3) immediately follows 

\ T^tk+£+i = T^tk+£ ^tk+e-i ^tk+e for any ^ G {1, ... , Sk+i - 1}. 
We are going now to define a map : N* — > N* U ^ in such a way that, for any i > 1: 

r TTi+i = 7ri7r-(^)7ri if G N*, 
\ TTj+i = ni'ijj{i)7ri if e A. 

The possibility to define inductively, in this way, the palindromic prefixes of using ip 
will be the crucial point in the construction of Section 4. 

Por > 3 we let ip{tk) = tk-i — 1, and if i > is not among the t^s we let = i — 
Then Equation (4) shows that (5) holds for any i > t^. To define the values for 
1 < i < ts, we distinguish between two cases. 

Pirst, let us assume Si — 1. Then tt^ = b^~^ for 1 < £ < i2 and nt^+e = {V'^aYV^ for 
any < ^ < sg- We let ^(1) = "0(^2) = a and V'(«) = i - 1 for i G {2, . . . , - 1} \ {^2}- 
Then Equation (5) holds for any i > 1. 

Now let us assume Si > 2. Then tt^ = a^~^ for 1 < i < ti and Titi+i = {a^^'^bYa"^^'^ for 
any < £ < S2. Moreover Equation (3) holds also for k = 2. We let ipi^) = 0,, V'(^i) = b, 
^(ta) = - 1 and = i - 1 for i G {2, . . . , tg - 1} \ {ti, ^2}- Then Equation (5) holds 
for any i >1. 

3.2 Standard Episturmian Words 

Denote by u'*^+^ the (right) pahndromic closure of a finite word w, that is the shortest 
palindrome of which w is a prefix. Let A = 61S2 ... be an infinite word on an alphabet A. 
Droubay, Justin and Pirillo gave [8] the following definition (see [12], Corollary 2.2): 
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Definition 3.1 The standard episturmian word with directive word A is the limit of the 
sequence {ni)i>i defined by tti — e and tTj+i = (TTj^j)^"*"^ for i > 1. 

The important point here (which will be generalized in §4.1) is that a standard epistur- 
mian word can be constructed as a limit of an infinite sequence of its palindromic prefixes. 

Given A, define a function : N* — >• N* U ^ as follows. For n > 1, let V'(n) = 6n if 
the letter 5n occurs for the first time in A at the n-th position. Otherwise, let ipln) = n' 
where n' is the greatest integer such that 1 <n' <n — l and = Then for any i > 1 
we have ([12], p. 287): 

TTi+i = TTiTT-^^TTi if i>{i) G N* 

and 

TTi+i = 7ri^(i)7ri if ilj{i) G A. 

The crucial remark in what follows is that these equalities could have been taken as a 
definition of the sequence (tt,), and therefore of standard episturmian words. 

Example 3.2 Let Si,S2,... be a sequence of positive integers, and A = {a,b} be a two- 
letter alphabet. The standard episturmian word with directive word A = a^^~^b^^a^''^b^'^ . . . 
is the characteristic Sturmian word with slope [0,si,S2, • • •]• This follows from ^3.1 (see 
also [6], proof of Theorem 5). 

Example 3.3 Let A = {a,b,c} and A = (abc)'^ = abcabcahc. . . . Then the standard 
episturmian word w with directive word A is ([12], Example 2.1) the Tribonacci (or Rauzy 
[15]) word (that is, the fixed point abacabaabacabab . . . of the morphism defined by a ^ ab, 
b ^ ac and c a). The corresponding function ip is given by ip{n) — n — 3 for n > A, and 
ip{n) — Sn for 1 < n < 3. 

4 Words with Abundant Palindromic Prefixes 

In this Section, we give a general construction (§4.1) of all words with abundant palindromic 
prefixes, using functions ■0. Then we define (§4.2) rec?ucec? functions ip; this definition allows 
us to state the main results about words with abundant palindromic prefixes, namely 
Theorems 4.12 and 4.14. At last, we explain in §4.3 how to compute S{w) (for a word w 
with abundant palindromic prefixes) using the associated reduced function ip. 

4.1 A General Construction 

Let -0 : N* ^ N* U ^ be any map such that, for each n> 1: 

either ^(n) e ^ or 1 < t[>{n) < n — 1. 
Define tti = e and, for i > 1: 

TTi+i = TTiTT^^^.^TTi if Jp{l) G N* 
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and 

TTj+i = 7ri'ip{i)Tri if e A. 

It is not difficult to prove by induction tfiat all tt^'s are palindromes, and that TTj is a prefix 
of TTj+i (for instance, if ^(i) G N*, writing TTj = 7r^(j)6j = 6i7r^(i) yields TTj+i = 6i7r^(j)6j = 
7r^(j)6f; the easy Lemma 5.1 stated below can also be used). However, in general there is 
no letter 5i E A such that TTj+i be the palindromic closure of HiSi. 

Definition 4.1 We call word with abundant palindromic prefixes associated with ip, and 
denote by w^, the limit of the sequence (tTj). 

This definition is consistent with the one given in the Introduction since the following 
result holds (it is proved in Section 5 as a consequence of Theorem 4.14 stated below): 

Theorem 4.2 Let w be an infinite word, and (nj)j>i be the increasing sequence (assumed 
to be infinite) of the lengths of its palindromic prefixes (with ni = 0). Then the following 
statements are equivalent: 

(i) We have nj+i < 2nj + 1 for any i > 1 (i.e., w has abundant palindromic prefixes). 

(a) For some function ip, we have w — (i.e., w is the word with abundant palindromic 
prefixes associated with ip). 

Let us study in more details the word with abundant palindromic prefixes associated 
with a map ip. First, let us consider the letter 5j in tTj+i that comes right after TTj. This is 
the first letter of 7r~^7ri+i, the one such that HiSi is a prefix of vTj+i. We have Si — if 
•0(0 G and 5i — 5^(i) otherwise. This explains the following definition: 

Definition 4.3 We call word of first letters associated with ip the word A = Si52 ■ ■ ■ defined 
(for each n > 1) by Sn — i^{n) ifip{n) e A, and Sn — (^^(n) otherwise. 

The assumptions on ip imply G A and tt2 = = Si. For ip{2) there arc two 

possibilities: either -0(2) G A (then tts = V(1)V'(2)^(1) and S2 = ^/'(2)), or ^/'(2) = 1 (then 
TTs = ^(IMI) and52=T/'(l)). 

Already from this example we can see that several functions t/j may lead to the same 
word of first letters A: for instance, taking ip{2) = G A yields the same value of 

62 as taking ip{2) = 1 G N*, but not the same value of tts. Using this example it is not 
difficult to produce functions ^0 and t/j' with the same word of first letters but such that 

7^ w^'- Therefore a word with abundant palindromic prefixes is not given just by 
its word of first letters A, but by a richer structure: the function^ ip. To be precise, ijj 
is given exactly by the word A = 6162 ■ ■ ■ together with the choice, for any n > 1, of an 
integer n' G {0, . . . , n — 1} that satisfies either n' = or 5n' = 5n. If we fix A, then a special 
choice of i/j is obtained by taking for n' the greatest integer n' < n such that Sn' — Sn (and 

^Actually one may restrict to reduced functions, see §4.2 below. 
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n' — ii there is no such integer, i.e., if the letter 5„ occurs for the first time in A at the 
n-th position). For this function ip, the word is the standard episturmian word with 
directive word A (see §3.2). Therefore Definitions 4.1 and 4.3 generahze Definition 3.1 of 
standard episturmian words. 

Remark 4.4 Two distinct functions ip and ip' always lead to distinct sequences (tTj) and 
{tt'^), but may lead to the same word = w^i (see Example 4-8 below). 

Example 4.5 If ip{n) = n — 1 for any n > N then i^N+i = t^n'^^ for any ^ > 0, with 
uj = TT^i^TTjv. Therefore in this case is ultimately periodic, hence periodic with a 
palindromic period (see Lemma 5.6 below). 

Example 4.6 Let A — {a, 6} be a two-letter alphabet, and {sk)k>i be a sequence of positive 

integers. For any k > 1, let tk = Si + . . . -\- Sk if Si > 2 and t^. = Si + . . . + Sk+i if Si = 1. 
In both cases, let to = 1. Moreover, let ip{i) = i ~ 1 if i > 1 is not among toi^ii^2, • • •; 
and ipitk) = tk-i — 1 for any k > 2. If Si > 2, let ipiX) = a and ip{ti) = b; if Si = 1, let 
-0(1) = h andiplti) — a. Then the wordw^ associated with ip is the characteristic Sturmian 
word with slope [0, si, S2, ■ ■ ■]. The function ip, the palindromes tt^ and the sequence (tk) are 
exactly the same as in §5.i (except that the index k in tk is shifted if si — 1). 



Example 4.7 In the previous example, if Sk 
and ip{i) — i — 2 for any i > 3. The word 

4.2 Reduced Functions 



= 1 for any k > 1 then = b, ■ip{2) = a 
— babbab . . . is the Fibonacci word. 



Two problems immediately arise from the construction of words with abundant palindromic 
prefixes. First, are there other palindromic prefixes of than the tt^'s ? Second, can two 
distinct functions ip and ip' lead to the same word w ? 

In general, the answers to both questions are positive, as shown in the following exam- 
ple. This is the reason why reduced functions are studied below. 

Example 4.8 Letip be a function, andi >2 be a integer, such thatil){i-\-l) — ip{i) — i — 1. 
Let bi be the finite non-empty word such that TCi = TTj-ifej. Then tTj+i = Tii-ibf and 7rj+2 = 
TTi-ibj. Now Lemma 5.1 stated below shows that 7rj_i6? is a palindromic prefix of7ri^2 (hence 
ofwjp), of length strictly between those o/ VFj+i and 7ri^2- This gives a palindromic prefix 
of which is not among the Tin's constructed from ip. To avoid this problem, consider 
a function ip' such that ip'in) — ip{n) for n < i, ip'{i -\- 1) — i and ip'{i + 2) — i -\- 1. 
Denoting by (tt'J the sequence of finite palindromes associated with ip' , we have tt'^ = 7r„ 
for n < i + 1, 7r-_|_2 = T^i-ibf and Tr-^g = Tii^ibf. For n > i -\- 3, we let ip'{n) = ip{n — 1) if 
ip{n — 1) < i + 1, and ip'{n) = ijj(n — 1) + 1 otherwise. Then we have tt'^ = iin-i for any 
n > i + 3, and = w^i . In this way the functions ip and ip' define the same word, but the 
family of finite palindromes associated with ip' contains the "missing" palindrome TTj-iftf . 
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Let z/) : N* — > N* U ^ be any function (in the sequel we always assume that, for each 

n > 1, either i/j{n) G ^ or 1 < ilj{n) < n — 1). 

Denote by {tk)k>o the family of all indexes n (in increasing order) such that either 
1 < ipln) < n — 2 or ^(n) e A. This family can be either finite or infinite. We always have 
to = 1, since -0(1) e A. 

Definition 4.9 A function ip is said to be reduced if the associated sequence {tk) satisfies, 
for any k > 1, the following two conditions: 

• 1p{tk) 7^ ip{tk-i)- 

• Either ijj{tk) G A or il){tk) < tk~i- 

In the special case where the family {tk) is finite (i.e., ilj{n) = n — 1 for n sufficiently 
large, see Example 4.5), we assume in this definition that both properties hold for any k 
such that tk exists. 

Remark 4.10 The function ip in Example 4-6 is reduced, and the definition of (tk) given 
there is consistent with the one introduced here. 

Remcirk 4.11 The function ip in Example 4-8 is not reduced. In fact there is an integer 
k such that i + 1 — tk, and we have tk-i < i — 1 — ip{tk). 

In the situation of Example 4.8, we have seen that ip is not reduced, and that the tt^'s 
are not the only palindromic prefixes of w^. Actually both phenomena are equivalent: 

Theorem 4.12 Let : W ^ W U A be a function such that, for each n > 1, either 
ip{n) & A or 1 < ip{n) < n — 1. Then the following assertions are equivalent: 

• The function ip is reduced. 

• The palindromic prefixes of are exactly the iTi 's constructed from ip. 

This theorem will be proved in the next Section (§5.3). It is not difficult to deduce the 
following Corollary (see Example 4.5 and Lemma 5.6). 

Corollary 4.13 Let ip be a reduced function. Then is periodic if, and only if, il^{n) — 
n — 1 for any sufficiently large integer n. 

Let t/) be a reduced function, and (tt^) be the associated sequence of finite palindromes 
(that is, thanks to Theorem 4.12, the sequence of all pahndromic prefixes of w^). Then 
the following assertions are easily seen to be equivalent: 

• For any sufficiently large i we have ■?/'(«) G N*. 

• For any sufficiently large i we have |7ri+i| < 2|7rj|. 
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If these assertions hold then wy, can be written on a finite alphabet. 

In addition to Theorem 4.12, another important property of reduced functions is the 
following generahzation of Theorem 4.2, proved in Section 5 below. 

Theorem 4.14 Let w be an infinite word, and (nj)j>i be the increasing sequence (assumed 
to be infinite) of the lengths of its palindromic prefixes (with ni = 0). Then the following 
statements are equivalent: 

(i) We have nj+i < 2ni + 1 for any i > 1 (i.e., w has abundant palindromic prefixes). 

(ii) There exists a function ip such that w — w^. 

(Hi) There exists a reduced function ip such that w — w^. 

Moreover the reduced function ip in (Hi) is unique. 

It is possible to write down a "reduction" algorithm (generalizing Example 4.8) that 
allows one to obtain, from any function the reduced function ip' such that = w^i. In 
this situation, the construction of §4.1 applied with ip gives a sequence (tTj) of palindromic 
prefixes of w^; with ip' , it gives another sequence (tt^). Theorem 4.12 shows that (tTj) is a 
sub-sequence of (tt-). Again, the "reduction" algorithm allows one to obtain exphcitly the 
full sequence (tt^ from the sub-sequence (tTi). This algorithm is partly used in [9], but in 
the present text we shall not need it; the crucial point here is just the uniqueness of the 
reduced function ip' corresponding to ip. 

Definition 4.15 Let w be a word with abundant palindromic prefixes. The reduced func- 
tion ip in Theorem 4-14 called the directive function ofw. 

Remark 4.16 The uniqueness assertion in Theorem 4-14 immediately follows from The- 
orem 4-i2 and Remark 4-4- 

Now we can put Definitions 4.3 and 4.15 together in the following way: 

Definition 4.17 Let w be a word with abundant palindromic prefixes. We call word of 
first letters associated with w the word of first letters associated with the directive function 
ofw. 

The following property holds: if (tTj) is the sequence of all palindromic prefixes of a 
word w with abundant palindromic prefixes, and A = 5i52 ... is the associated word of 
first letters, then WiSi is a prefix of TTj+i for any i > 1. 
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4.3 Computation of 6{w) using Reduced Functions 



Definition 4.18 With any reduced function ijj we associate the increasing sequence of 
non-negative integers {ni)i>i defined by ni — and, for all i > 1: 

Ui+i = 2ni - n^i^i) ifip{i) G N* 

and 

Ui+i — 2ni + 1 if '^{i) e A. 

Theorem 4.12 shows that Ui is the length of the i-th pahndromic prefix of w^. In the 
same way, we introduce the following definition so that 5{ip) — S{w^): 

Definition 4.19 For any reduced function ip we let 5{ip) = limsup^^^, where (ui) is 
associated with ip as in Definition 4-18. 

This definition of 6{iIj) is completely elementary. It is useful because of the following 
fact: for a word w with abundant palindromic prefixes, we have 6{w) = 6{ip) where ip is 
the directive function of w (see Definition 4.15). 

5 Proof of the Main Results 

5.1 General Lemmas about Palindromic Prefixes 

The first lemma is very easy, and sufficient to prove half of Theorem 4.12 (see §5.3 below). 

Lemma 5.1 Let p and u he two words, such that p and pu are palindromes. Then pn^ is 
a palindrome (and so is, by induction, the word pu"' for any n>2). Similarly, if p and up 
are palindromes then u"p is a palindrome for any n > 0. 

Proof: If p and pu are palindromes then we have p — p and up — pu hence 

pu^ — uup — upu — pu^ 

The case where p and up are palindromes is analogous. This concludes the proof of Lemma 
5.1. □ 

In particular, in this situation pu and pu^ are palindromes, one is a prefix of the other, 
and the quotient of their lengths is less than 2 (or equal to 2 when p is empty). The 
following lemma gives a kind of converse to this phenomenon (at least in the case n' — n). 

Lemma 5.2 Letw be an infinite word, andn, n' , n" be integers such thatn' < n" < n+n' . 
We assume that the prefixes of w with lengths n, n' , n" are palindromes, denoted by a, a' 
and a" respectively. Let oq be the prefix of w of length n + n' — n" . Then the following 
holds: 
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• There is a word b such that a — a^h and a" — a'b. 

• If n" > n — n' then is a palindrome. 

Remark 5.3 This lemma will be used only when n" > n — n' , and in this case the first 
property will be written 

a" = a'a^^a 

since gq is both a suffix of a' and a prefix of a. Moreover, an important special case is when 
n — n'. The lemma then reads: if a and a' are palindromes, with n <n" < 2n, then ao is 
a palindrome and we have a — a^b and a" — a^b^ . 

Proof of Lemma 5.2: As a' is a prefix of a", there exists a word b such that a" = a'b. 
The word 6 is a suffix of a", therefore its mirror image 6 is a prefix of a" (hence also of w) 
since a" is a pahndrome. Now b has length n" — n' < n, therefore is a prefix of a. As a 
is a palindrome, 6 is a suffix of a: there exists a word c such that a — cb. It is clear that 
c = ao is the prefix of w of length n + n' — n" . 

Assume now n" > n — n', and let us show that oq is a palindrome. Let 1 < i < 
{n + n' — n")/2 ; then we have i <n' hence: 

by using successively that a, a" and a' are palindromes. This concludes the proof of Lemma 
5.2. □ 

Lemma 5.4 Let w be an infinite word. Let n' < n" be two consecutive lengths of palin- 
dromic prefixes of w; let us denote by n' and n" the corresponding prefixes, with tt" = tt'lu 
for some word cu. Then any palindromic prefix n of w such that n' < |7r| <n'-\- n" can be 
written tt'cu^ with t > 0. 

Proof: Assume there is a prefix tt of w, of length n, which contradicts the lemma and 
has minimal length. As n' and n" are consecutive, we have n > n" . Lemma 5.2 gives a 
palindromic prefix ttq of w of length n— \uj\ > n', such that tt = ttqU!. This contradicts the 
minimality of tt, and concludes the proof. □ 

Lemma 5.5 Let w be an infinite word. Let uq < Ui < n2 be three consecutive lengths of 
palindromic prefixes of w; let us denote by ttq, tti and 1:2 the corresponding prefixes. Then: 

• Either n2 — ttitTq^tti, 

• Orn2 > no + rii. 

Proof: If n2 < no + ni, one may apply Lemma 5.2 with n = n2, n' = uq and n" = ui. 
Then 712 + no — rii is the length of a palindromic prefix of w; but this length is strictly 
between no and n2, therefore it is ui. We get in this way n2 — ttitTq^tti, which concludes 
the proof of the lemma. □ 
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5.2 Ultimately Periodic Words 

Lemma 5.6 Letw be an infinite ultimately periodic word, infinitely many prefixes of which 
are palindromic. Then w is periodic with a palindromic period. Moreover, if d denotes the 
smallest length of a period of w then there exists r G {1, . . . , d} with the following property. 
For any n > d, the prefix of w of length n is a palindrome if, and only if, n = r mod d. 

Proof: If w were ultimately periodic but not periodic, there would exist two non-empty 
words TTo and tt such that w — TToTTTTTr . . ., and such that the last letter of ttq be different 
from that of tt. But this contradicts the assumption that w has arbitrary long palindromic 
prefixes. In fact, if we denote hy Zi ... Zd the word tt and by zq ^ Zd the last letter of ttq, 
then this assumption implies that the word z^-i ■ ■ - Zq appears infinitely many times in w, 
and is therefore a cyclic permutation of the period Zi . . . z^. As Zq ^ za, this is impossible. 

Therefore w is periodic, and can be written w — tttttt . . . with a period tt of minimal 
length d. Let n > dhc the length of a palindromic prefix of w. Then we have Wi — Wn+i-i 
for all i G {1, . . . , li}. If n' is another such integer, not congruent to n mod d, we obtain 
Wi = Wi+s for all i G {1, . . . , 0?} with I < e < d — 1; this contradicts the minimality of 
d. Therefore all lengths of palindromic prefixes lie in the same congruence class mod d; 
conversely it is clear that any n> d that belongs to this class is the length of a pahndromic 
prefix oi w. □ 

Example 5.7 For the word a{ahhaY — {aahhY , we have d — 4 and r — 2. 

5.3 Proof of Theorem 4.12 

Throughout the proof, we fix a function ■0, and consider the palindromes tt^ used to define 
■u;^. For any i > 1 we let — |7rj|. 

First, let us prove the easier implication (using only Lemma 5.1). Assume ip is not 
reduced, and all palindromic prefixes of are among the TTj's. There is an index A; > 1 
such that either ^p{tk) > tk-i (with '4){tk) G N*) or ip{tk) = ^(tfc_i). 

Let j = tk-i and i = t^. There are non-empty words h and h' such that tTj+i = Tijb and 
TTj+i = T^ih' . Since '^(j' -|- 1) = . . . = ■0(i — 1) = 1, we have tt^ = TTjfe^"-' for any £ G {j, . . . , i} 
and in particular tTj = '^jb'^~^ ■ Applying Lemma 5.1 to the palindromes Trjb^"^"^ and TTj 
proves that TTjfc is a palindrome. If b' = 6" with n > 2, this is a palindromic prefix of TTj+i 
(hence of w^), whose length is strictly between those of TTj and TTj+i; this contradicts the 
assumption that all palindromic prefixes of are among the tt^'s. Therefore b' cannot be 
a non-trivial power of b. 

In the case where ■0(0 £ N* and > j, we have b' — with ■0(i) < i — 2, hence 

a contradiction. 

Assume now ipi^j) = tpii) G A. Then b = ip{j)nj hence b' — il){j)'Ki — ip{j)Trjb'^~^ — 
&*~-?+^ with i — j + 1 > 2; this is again a contradiction. 
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At last, assume — £ f*^*- Then we have in the same way b — '^^(^j)'^j 

hence b' — 7r^(j)7ri = b'^~^'^^, hence a contradiction. This concludes the proof of the first 
implication in Theorem 4.12. 

Let us prove the converse now. Assume ip is reduced, and let w = w^. Let n" be the 
palindromic prefix of w of minimal length among those which are not tt^'s. Let i be the 
integer such that |7ri| < |7r"| < |7ri+i|. Since tti = s and 1:2 = V'(l) € A, we have i >2. 

Let uj be such that n" — i^iU. Then Lemma 5.4 gives an integer t > 2 such that 
TTj+i = TTiO;*; the definition of TTj+i shows that a; is a suffix of TTj. Now Lemma 5.2 (with 
n = n' = Hi and n" = |7r"|) implies that tTjCJ"^ is a palindromic prefix of w. 

Let us prove that i is among the t^s. This is obvious if G so we may assume 
ip{i) G N*. Then TTj = 7r^(i)a;* and the palindromic prefix 7iiUJ~^ has length strictly between 
|7r^(i)| and |7ri| since t > 2. By minimality of tt", this implies '^(i) < i — 2, and concludes 
the proof that there exists A; > 1 such that i — tk- 

Let j = tfc-i, and 6 be the word such that 'n'j+i = TTjb. Then we have TTj = 7rjb^~\ since 
= £ — 1 for any £ G {j + 1, . . . , i — 1}. Now we have t >2, hence 2|a;| < rij + 1 and 
Lemma 5.5 yields: 



Hi 



1 + |6| - 1 Uj + nj_i 



rij — a; > > — > — — > n,_i. 

J I I - 2 - 2 ~ 2 ^ 

Therefore the palindromic prefix tTjO;"-'^ is among tTj, 7rj_i. This shows that a; is a 
power of 6, say cu — b^. 



UJ 



First, let us assume ip{i) G ^. If we also have ■00) € «4. then b — ■^OO^Tj and ■0(i)7rj = 
5«* = {il){j)'njY* hence = G A, which is a contradiction. Now in the case 



t IMt 



^ Awe have < 'h.^.q) < rij^i < \b\ — l by Lemma 5.5. However n^Q) = rij = rii = —1 
mod |6| since ip{i)'n'i = a;*; this is again a contradiction. 

To conclude the proof, we have to consider the case where G N*. Since ip is reduced, 
this gives < t^-i = j- We see that ri^(i) = rii — t\uj\ belongs to rii + |6|Z = rij + |6|Z. 
Now nj < 2nj_i + 1 < 2\b\ thanks to Lemma 5.5, hence n^{i) = rij — This implies 
\b\ < rij, hence ■ip{j) ^ A. But we then have n^{j) = rij — \b\ = n^(i), and this equality 
contradicts the assumption that ip is reduced. 

This concludes the proof of Theorem 4.12. □ 



5.4 Proof of Theorem 4.14 

The uniqueness statement at the end of Theorem 4.14 follows from Theorem 4.12 and 
Remark 4.4 (as noticed in Remark 4.16 above). Let us prove that (i), (ii) and (Hi) in 
Theorem 4.14 are equivalent. 

The implications (Hi) =^ (ii) and (ii) =^ (i) are obvious; let us prove that (i) implies 
[iii). Assume that w satisfies nj+i < 2nj + 1 for alH > 1. We denote by tTj the palindromic 
prefix of w with length rii. 
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For any i > 1, let us define ■0(0 in the following way. If rii+i — 2ni + 1, we let e A 
be the letter in tTj+i that comes right after tTj (that is, the central letter of the palindrome 
TTj+i which has odd length). Otherwise, we apply Lemma 5.2 with n = n' = rii and 
n" = nj+i. This gives an integer ip{i) between 1 and i — 1 such that TTi+i = vriTT^^^j^TTj. 

With this construction, it is clear that w — w^. More precisely, the palindromes tTj 
are exactly those constructed using in §4.1. Since they are (by hypothesis) the only 
palindromic prefixes of w. Theorem 4.12 proves that ^0 is reduced. 

This concludes the proof of Theorem 4.14. □ 

6 Palindromic Prefix Density 

In §6.1, we show how the results on words with abundant palindromic prefixes can be 
generalized to words such that rij+i < 2ni + 1 for any sufficiently large i. This enables us in 
§6.2 to describe T) in terms of 5{il)) for reduced functions ip^ and to prove Theorem 1.2 stated 
in the Introduction. This description makes use of a technical statement (Proposition 6.2), 
the proof of which is postponed to the Appendix (Section 9). 

Given an infinite word w, we denote by (n^) the increasing sequence of all lengths of 
palindromic prefixes of w (with ni = 0). We let tTj be the palindromic prefix of w with 
length rij. 

6.1 Words with Asymptotically Abundant Palindromic Prefixes 

The following proposition is a generalization of results stated in Section 4. It enables one 
to construct all words satisfying n^+i < 2nj + 1 for any sufficiently large i. 

Proposition 6.1 Let w be a word with infinitely many palindromic prefixes, and io be an 
integer. The following statements are equivalent: 

(i) We have rij+i < 2ni + 1 for any i > io. 

(ii) There exists a reduced function ip such that, for any i > io-' 

• Either il){i) E A and tTj+i = TTiil){i)T:i, 

• Or ipli) e N* and TTj+i = TTiTr^^^.^TTj. 

This proposition means that the palindromic prefixes tt, of w satisfy (for i sufficiently 
large) the same recurrence relation as those of the word with abundant palindromic prefixes 
w^. This recurrence relation is completely determined by the function ip. 

Let -0 be a reduced function, and TTj,, be a finite word with exactly palindromic prefixes 
(including tti = e and TTi^ itself), denoted by tti, . . . , tTiq. Then using the construction of 
§4.1 (which is the same as (ii) in Proposition 6.1) for i > io one obtains an infinite word 
w with infinitely many palindromic prefixes, of which tTiq is a palindromic prefix. For this 
word we have rij+i < 2ni + 1 for any i > io, but there is no reason why this relation would 
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hold for i < io'. for instance ii io — 2, may be any palindrome whose only palindromic 

prefixes are e. its first letter and itself, so it might have length greater than 3. Moreover, 
Proposition 6.1 implies that any word w satisfying (i) can be obtained in this way. 

Proof of Proposition 6.1: The implication (ii) (i) is clear. To prove the converse, we 
define ■0(i) for i > io as in the proof of Theorem 4.14 in §5.4. For i < io, we let = a if 

1 is even, and ipli) = 6 if i is odd, where a ^ b are two elements in A (the trivial case where 
the alphabet contains only one letter is easily dealt with). All assertions in Proposition 

6.1 immediately follow, except the fact that ip is reduced. To prove this fact, one follows 
exactly the same lines as in the proof of Theorem 4.12 in §5.3. □ 

6.2 Elementary Description of P and 

To estabhsh a relationship between T> and Vq, we shall use the definitions and statements 
of §4.3, and the following (technical) proposition proved in the Appendix (§9.3): 

Proposition 6.2 Let ip be a reduced function such that 6{ip) < 2, {rii) be the associated 
sequence, and (n^)i>i be another increasing sequence of non-negative integers such that 
n'i+i = 2n[ - n^(.) for i sufficiently large. 

Then the quotient n'^/ui has a finite positive limit as i tends to infinity, and we have: 

5(V') =limsup^. 

Let w be any word such that S{w) < 2, and (nj)j>o be the sequence of all lengths of 
palindromic prefixes of w. Proposition 6.1 yields a reduced function ip such that nj+i = 
2ni — n^(i) for any sufficiently large integer i. This proves that (n^) satisfies the same 
recurrence relation as the sequence associated with ip (in Definition 4.18), but the initial 
values may be distinct. Proposition 6.2 shows that these initial values have no influence 
on limsup hence S{w) — 5{ijj). This proves that V fl [1, 2) is contained in the set of 
values S{ip) for reduced functions ip, which is exactly (see §4.3). Since Vq is obviously 
contained in V, this gives I^n[l, 2) = r'on[l, 2). Now considering a characteristic Sturmian 
word whose slope has unbounded partial quotients proves (thanks to Equation (1)) that 

2 e T>Q C T>. Therefore we have proved the following result: 

Theorem 6.3 Both Vq and I^n[l, 2] are exactly the set of values taken by S{ip) for reduced 
functions ip. 

Together with the inclusion (2, +oo] C T> proved in §2.2, this statement imphes Theo- 
rem 1.2. 
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7 Peculiar Study around \/3 



In this Section, we focus on the sets Vq and V around \/3. First, we prove Theorem 1.3 
stated in the Introduction, namely Vq n [1, \/3] = V n [1, v^] (§7.1). Next, we define and 
study ^-strict words with abundant palindromic prefixes (§7.2); this allows us to prove 
that any ^-strict word w with abundant palindromic prefixes such that 5{w) < y/S is either 
periodic or characteristic Sturmian. 

At last, we prove that Theorem 1.3 is optimal, since there is no gap in Dq right above 
\/3 (§7.3) whereas there is one in V (§7.4). 

As in the previous Section, we shall use a technical result (Lemma 7.1, stated in §7.1) 
the proof of which is postponed to the Appendix. 

Given an infinite word w, we still denote by (n,) the increasing sequence of all lengths 
of palindromic prefixes of w (with rii — 0). We let tTj be the palindromic prefix of w with 
length rii. 

7.1 Proof of Theorem 1.3 

Let be a reduced function. As in §4.2, we denote by {tk)k>o the family of all indexes n 
(in increasing order) such that either 1 < ijj(n) < n — 2 or ilj[n) e A. 

Assume that S{ip) < 2. Then e N* for any sufficiently large integer i, hence: 

• Either the family (tk) is finite, that is -0(0 — i — ^ for any sufficiently large i. This 
obviously implies S{ip) — 1. 

• Or the family (tk) is infinite, and for k sufficiently large we have ip{tk) < tk-i- 

The first condition corresponds to periodic words (see Corollary 4.13). In the second 
condition, the special case where ip{tk) = tk-i — 1 for any k >2 corresponds to characteristic 
Sturmian words (see Example 4.6). The following result shows that any reduced function 
ip with S{ip) < -\/3 is either "periodic" or "asymptotically Sturmian". 

Lemma 7.1 Let tfj be a reduced function such that 5{ip) < \/3. Then either the family (tk) 
is finite, or for any k sufficiently large we have 

i>{tk) = tk-i - 1. 

The value VS in this lemma is optimal (see the end of §7.3). We postpone the proof 
(which is completely elementary, but rather technical) to the Appendix (§9.4). 

Now let us prove Theorem 1.3 stated in the Introduction, namely H [1, Vs] = V (1 
[1, \/3]. First of all, it is readily seen that ^/3 = [1,1,2^ eV G Vq (see the beginning of 
§7.4). Let ■0 be a reduced function such that 5{'4>) < -\/3. If is periodic then Corollary 
4.13 implies 5(V') = 1 G T>' . Otherwise the associated sequence (t^) is infinite, and satisfies 
i^itk) — ijfc-i — 1 for any sufficiently large k thanks to Lemma 7.1. Let — t^ — for 
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A; > 1, and ■0' be the function defined from (sk) in Example 4.6. Proposition 6.2 shows that 
Sli/j) = S{'4)') = S{ca,), where is the characteristic Sturmian word with slope [0, Si, S2, ■ ■ ■]■ 
Thanks to Theorem 6.3, this concludes the proof of Theorem 1.3. □ 

Actually the result we have proved is slightly more precise that Theorem 1.3: for any 
non-periodic word w such that 5{w) < \/3 wc have found a characteristic Sturmian word 
Ca such that the palindromic prefixes of w and those of Ca satisfy (asymptotically) the 
same recurrence relation. In the next Section, we show that two additional assumptions 
(namely abundance of palindromic prefixes and ^-strictness) imply w = c^. 

However there is a characteristic Sturmian word w, and a non-episturmian ^-strict 
word w' with abundant palindromic prefixes, such that 5{w) = 5{w') = y/S (see the end of 
§7.3). This shows that S{ip) does not characterize a reduced function ip. 

7.2 Initial Values and Strict Words 

Let us consider (on the three-letter alphabet A — {a, b, c}) the finite word 7r4 — abacaba. 
It has four palindromic prefixes: tti = e, 7r2 = a, tts = aba and 114. It is a palindromic 
prefix of the word constructed (as in §4.1) from any function ip such that = a, 
-0(2) = b and -0(3) = c. Now if ip is given by '0(i) — i — 2 ior any i > 4 then behaves 
"asymptotically" hke the Fibonacci word (see Example 4.7); for instance S{w^) — 7. 

This word is a standard cpisturmian word, which is not Sturmian (it cannot be 
written on a two-letter alphabet), but which behaves like a Sturmian word. To avoid this 
kind of examples, one usually restricts to ^-strict standard episturmian words ([8], §4.2), 
also known as characteristic Arnoux-Rauzy words. 

Now let us turn to the (more general) case of words with abundant palindromic prefixes. 
There are words which have abundant palindromic prefixes, behave like a Sturmian word 
(as above), but are not Sturmian - and not episturmian either. We introduce the following 
definition (recall that with any word w with abundant palindromic prefixes we associate 
in Definition 4.17 its word of first letters A 5i52 ■ ■ .): 

Definition 7.2 A word w on an alphabet A, with abundant palindromic prefixes, is said 
to be A-strict if every letter in A occurs infinitely many times in the word of first letters 
ofw. 

This definition extends that of ^-strict standard episturmian words ([8], §4.2). It 
allows us to state the following result (recall from [8], Theorem 4, that ^-strict standard 
episturmian words on a two-letter alphabet A are exactly characteristic Sturmian words): 

Theorem 7.3 Let w be any non-periodic word with abundant palindromic prefixes. We 
assume w to be A-strict, and S{w) < y/S. Then A contains exactly two letters, and w is 
characteristic Sturmian. 

As a special case, we get the following result: 
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Corollary 7.4 Letw be any non-periodic A-strict standard episturmian word, with5{w) < 
Then A contains exactly two letters, and w is characteristic Sturmian. 



Proof of Theorem 7.3: Denote by (nj) the increasing sequence of all lengths of palindromic 
prefixes of and by i}) be the directive function of w. Lemma 7.1 shows that ■0(^fe) — 
tk~i — 1 for any sufficiently large k. Denote by A the word of first letters associated with 
ij) (i.e., with w). Then 5„ = dn-i if ^ is not among the t^s, and 6t^ = = St,^_.^ 

for any sufficiently large k. Therefore 5„ takes infinitely many times at most two values. 
Since w is ^-strict, A contains at most two letters. If A is reduced to a single letter then 
w is nothing but the periodic repetition of this letter. Otherwise w is a standard ^-strict 
episturmian word on a two-letter alphabet, hence is characteristic Sturmian ([8], Theorem 
4). □ 

7.3 Non-Episturmian Examples near 

The following proposition, together with Proposition 7.7 proved in §7.4, shows that the 
value Vs is optimal in Theorem 1.3. 

Proposition 7.5 There exists a decreasing sequence in Vq with limit -\/3- 

To prove this, consider for any integer n>2 the function defined as follows (with 
the two-letter alphabet A = {a, b}). 

Let 0„ : {0, . . . , 4n} ^ {1, 2, 3} be defined by: 

• 0n(O) = 3. 

• For i e {l,...,n}, 0„(i) = 2. 

• For i E {n + 1, . . . , 4n} with i = n + 1 mod 3, 0n(^) = 2. 

• For i e {n + 1, . . . , An} with i = n + 2 mod 3, 0n(O — 1- 

• For i E {n + 1, . . . , An} with i = n mod 3, (l>n{i) = 3. 

Then wc let V'n(l) = cl, '^n(2) = b and ipn{i) = i — (t>n{i') for any i > 3, where i' is the 
integer between and An which is congruent to i modulo 4n + 1. 

The word of first letters associated with is (in the case where n > 3 is odd) 



By definition, for any i > 3 wc have = ^j, with A^j = 8182 ■ ■ ■■ Moreover, if i > 3 is 

not a multiple of 4n -|- 1 then V'„(i) is the largest index i' such that = 5j. But when i is 
a multiple of 4n -|- 1, we have ■0n(O = ^ — 3 with (5i_3 = 5j_2 — 5i — b. Moreover it is easily 
checked that ipn is reduced. Therefore Theorem 4.12 shows that w^^ is not a standard 
episturmian word. 
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It is possible to prove that S{t/jn) is greater than -\/3, and tends to y/3 as n tends to 
infinity. □ 

Using the same ideas, it is possible to construct a word (which is not standard epis- 
turmian) such that S{w^) — -\/3, by choosing an increasing sequence (n^), with sufficiently 
fast growth, and building a function (f) by concatenating the functions 0„j.. This proves 
that the conclusion of Lemma 7.1 does not hold for any functions ip such that 5{ip) — -\/3. 



Remcirk 7.6 The word constructed in this way is not episturmian; however, the value 
of 5{w^), namely s/Z, equal to 5{w') for some characteristic Sturmian word w' (see the 
beginning of %7.4)- This proves that knowing 5{w) does not provide information on the 
structure ofw. 



7.4 The Sturmian Spectrum near \/3 

To prove that 

= [1,1, 2;t] = 1.7320... 

belongs to "D', it is enough to apply Equation (1) with Sj. = 1 for k even and 8^ — 2 for k 
odd. Now taking = 3 for any k yields another element of V: 

^±^ = [1,1,3] = 1.7675.... 

The following proposition proves that there is nothing inbetween: 

Proposition 7.7 There is no element ofV between ^/3 and '^'^^ ■ 

Proof: For a sequence b — (61,62, • • •) of positive integers, and a non-negative integer 
k, we let T'^b = {bk+i,bk+2, ■ ■ ■)■ Cassaigne proved in [3] that V is the set of numbers 
[1, 1, 61, 62, • • •] where the sequence b satisfies [b] > [T'^b] for any k > (where [b] is the 
continued fraction [61, 62, • • •])• Let b be such a sequence, with \/3 < [1,1, Oi, O2, . . .1 < 

6 ■ 

First of all, let us prove that bi G {1,2} for any i > 1. Indeed, the assumption 
[1, 1, 61, 62, ■ ■ ■] < [1,1,3] means [b] < [3] hence bi < 3. If fei < 2 then the assertion is 

proved (since [b] > [T^'b] for any k). Otherwise bi = 3 and bi G {1,2,3} for any i. But 
[62, &3, • • •] > [^ yields 62 > 3 hence 62 = 3. Now [b] > [Tb] gives [3, 63, • • •] < [h, • • • , ] 
hence 63 = 3. Repeating these arguments gives bi = 3 for any i, in contradiction with the 
assumption. 

Now we have [b] > \2/L\ with bi e (1, 2} for all i. This gives &i = 2 and [62, &3, . . .] < [1,2J 
hence 62 = 1- Repeating this process yields [b] — [2, IJ, that is a contradiction. 
This concludes the proof. □ 
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8 Open Questions 



This Section is devoted to open questions about words w with abundant pahndromic 
prefixes. Some of them were asked by various speciahsts I would hke to thank. 
We let w he a word with abundant palindromic prefixes. 

Do letters (and more generally factors) have frequencies in w7 Is it possible to compute 
these frequencies in terms of the directive function of w (see Definition 4.15)? 
What is the complexity of 

What are the recurrence quotient, and the critical exponent of w (in terms of ■?/')? 

What is the bound one should put instead of in Theorem 7.3 to ensure that A 
contains at most 3 letters? More generally, for any integer p one could study the set of 
values taken by S{w), for words w written on an alphabet of at most p letters. 

Are there other ways to construct the set of all words with abundant palindromic 
prefixes (as for standard episturmian words)? 

Is is possible to write words with abundant palindromic prefixes as fixed points of 
morphisms? 

Is there a way to define "non-standard" words with abundant palindromic prefixes ? 
This would be a class of words that behaves with respect to words with abundant palin- 
dromic prefixes in the same way as Sturmian words with respect to characteristic Sturmian 
words, and in the same way as episturmian words with respect to standard episturmian 
words. 

What does V look hke between the least accumulation point a^o and \/3 (see [2], §8)? 
What does T>o look hke above -\/3? In which intervals is it dense? 

9 Appendix: Study of Some Recurrence Relations 

In this Appendix, we study the linear recurrence relation satisfied by the sequence (n^) of 
all lengths of palindromic prefixes of a word w with abundant palindromic prefixes. We 
focus on asymptotic properties of this sequence (and especially on limsupnj+i/nj). The 
point of view is to forget everything about words: all statements and proofs are completely 
elementary, and rely only on the recurrence relation associated with a reduced function 

The point is to prove two technical statements used in the text: Proposition 6.2 and 
Lemma 7.1. Moreover, the tools introduced here are used in [9]. 

9.1 Definitions 

Recall from §4.1 that we consider functions -0 : N* ^ N* U ^ such that, for each n > 1: 

either ■0(n) e ^ or 1 < ■0(n) <n — l. 

With such a function we associate in §4.2 a (finite or infinite) family {tk)k>o, namely the 
family of all indexes n > 1 (in increasing order) such that either 1 < ip{n) < n — 2 or 
ip{n) e A. 
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We recall that is reduced (see Definition 4.9) if, for any k > 1, the following two 
conditions are satisfied: 

• ijj{tk) 7^ ^/^(tfc-i)- 

• Either ip{tk) G ^ or ip{tk) < tfc-i- 

In the special case where the family (tk) is finite (i.e., ilj{n) = n — 1 for n sufficiently large), 
we assume in this definition that both properties hold for any k such that tk exists. 

Moreover, with any reduced function ip we associate (as in §4.3) the increasing sequence 
of non-negative integers (nj)j>i defined by ni = and, for all i > 1: 

rij+i = 2ni - n^(i) if e N* 

and 

rii+i = 2nj + 1 if ■0(0 £ A, 

and we let ^ 

(5('0) = limsup — 
rii 

9.2 First Lemmas 

Lemma 9.1 Let ip be a reduced function, and {ni)i>i be the associated sequence. Then for 
any k > 1 we have, with i — tk-' 

rii+i > rii + ni^i. 

Proof: We proceed by induction on k. We have to = 1 hence ip{ti) e A since ip is 
reduced. Therefore njj+i = 2nt-^ + 1 > nt-^-i, which proves the result for k — 1. Assume it 
holds for k. If ■0(^fe+i) £ A then it clearly holds for A; + 1; otherwise we have: 

This concludes the proof. □ 

Lemma 9.2 Let ip be a reduced function. Then S{ip) < 2 if, and only if, there is an integer 
B such that, for any sufficiently large i: 

^{i) e N* and ^{i) >i-B. 

Proof: Denote by (rij) the sequence associated with ip. 

Assume there is an integer B be such that ■?/'(?) > i — B for all i sufficiently large. 
Then for i sufficiently large we have ip{i) < Sui hence n^(i) > 3~^nj, and therefore 
limsup nj+i/nj < 2 — 3~^. 
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Now, assume limsup < 2 — £ with £ > 0. Then obviously we have e N* for i 
sufficiently large. Moreover Lemma 9.1 yields, for k large enough: 

nt,-i < {1 - £)nt^. (6) 

Let Sk — tk — tk-i, the previous inequality yields 

nt, = nt,_, + Skints - nt^_i) > n*,^^ + SkEUt,, 

therefore Sk < 1/e: the sequence (s^) is bounded. Now for i large enough, and A such that 
< tk-x < tk < i, Equation (6) gives 

erii < rii+i - 2ni = n^(i) < (1 - e)^'^^ni 

that is, an upper bound on A. This concludes the proof of the lemma. □ 



9.3 An Independence Property 

This Section is devoted to a proof of Proposition 6.2, which means that can be defined 
using the asymptotic behavior of any solution (n'J of the associated recurrence relation: 
the initial values of the sequence {ni) do not matter. 

Let {n'-) be as in the statement of Proposition 6.2 (see §6.2). Lemma 9.2 provides an 
integer B such that ■^(i) e N* and 'ip{i) > i — B for all sufficiently large i. Let e[ = n'^^i—n[. 
The recurrence relation satisfied by {n[) yields (for i sufficiently large) 

i 

j=iP{i+l) 

This implies that {e'^) is non-decreasing for i sufficiently large, and tends to infinity as 
i tends to infinity (except in the special case where tpii) = i — 1 for any sufficiently 
large i, which is easily dealt with). Moreover, the same properties hold for the sequence 
£j = nj+i — rij. Now, let 

The relation above yields, for i sufficiently large: 

i i i 

j=ip{i+l) J=V(i+l) J=V'(i+l) 

hence 

* £'■ 
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Now let io be sufficiently large, and for i > io let li be the convex hull of ctj+i-B, . . . , ctj 
(that is, the smallest segment in that contains these points). Wc shall deduce from (7) 
the following claim (where |/| denotes the length of a segment /): 

li+B C li and \Ii+B\ < (1 - B'~^)\Ii\ for any i > io. 

If the claim holds then the intersection of all Jj's is reduced to a positive real number, which 
is the limit of the sequence (««). As e'^ tends to infinity with i, it is a classical consequence 
that 

rh _ no + £o + £i + ■ ■ ■ + £i-i 
K n'o + e'o + e[ + ... + e[_-^ 

converges to the same limit. This concludes the proof of the proposition - if the claim 
holds. 

To prove the claim, write Ii — [ctj — ai + 5f] and notice that the right handside of 
(7) is a linear combination of ctj+i-B, . . . , ctj with non- negative coefficients. Moreover, the 
coefficient of ctj is at least 1/B. Therefore bounding aj in (7) from below by ctj — 5'^ (resp. 
from above by ctj + 5^) for j ^ i shows that 

e [ai - (1 - llB)5r, «, + (!- 1/5)5+]. 

Applying this result inductively yields, for any £ > 0: 

ai+i e [ai - (1 - B-^)5-, «, + (!- 5-^5+] C /. 

This proves the claim, thereby concluding the proof of Proposition 6.2. □ 

9.4 A Special Property of a/3 

In this Section, we prove Lemma 7.1 stated in §7.1. 

Let be a reduced function such that 5{^l)) < y/S, and (n,) be the associated sequence, 
as in §9.1. For any A; > 1 and any i e {tk, ■ ■ ■ ,ifc+i — 2}, we have + 1) = i hence 
nj_|.2 — Tii+i = nj+i — Ui. Let us denote by ak the common value of n^+i — Ui for i G 
{tk, ■ ■ ■ ,tk+i — !}• Excluding the case where the family (t^) is finite, it is clear that the 
sequence (a^) is increasing. Let 6 < y/S and > 2 be such that Uj+i < 6ni (hence 
e N*) for all i > tx- 

Assume there is an index k > K such that ipltk) < tfc_i — 1. 

First of all, let us write (using Lemma 9.1 with i — tk-i) 

n^itk) = 2nt, - > (2 - 5)nt^ > (2 - 5){nt^_^ + ni,_^_i). (8) 
Substracting ak-2 — ^tfc-i ~ ^tfc-i-i from this inequality yields (since 5 >1): 

n^{t^) - ak-2 > (1 - S)nt,_, + (3 - S)nt,_,-i > (3 - 5^)nt,_,-i. (9) 
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Since 5 < y/S, the right handside of (9) is positive. Therefore Lemma 9.1 yields n^(t^) > 
ak-2 > nt^_2-i hence ip{tk) > tk-2- This inequahty, together with the assumption ip{tk) < 
tk-i - 1, yields rit^.i-i - ak-2 = "'tfc_i-2 > n^{tk)- Combining this with (8) gives 

{2S - 3)nt,_,-i > (3 - S)ak-2, (10) 

which implies, in particular, 6 > 3/2. Now we also have 

hence 

ak-2 > (2 - 5)nt^_^ > (2 - S){ak-2 + nti^_^-i). 
Combining this relation with (10) yields S > -\/3, that is a contradiction. □ 
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